A POMDP Approach to Token-Based Team Coordination
نویسندگان
چکیده
Efficient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. Token-based approaches have shown to be a novel and promising way for such coordination. However, previous token-based algorithms were built on heuristics and did not explicitly consider utilities related to token movements or changes in team states. In this paper we put forward an algorithm that uses team rewards to improve token routing decisions. The ideal solution of this token movement model is a centralized Markov Decision Process (MDP) with joint activity. Unfortunately, the assumptions underlying this model are not feasible for large team coordination and we have to make several approximations. First, we decentralize the centralized MDP as a set of standard MDPs with independent individual activities. Then this MDP is approximated by a Partially Observable Markov Decision Process (POMDP) because agents in a large team may not know the exact states of their teammates or that of the environment. A logical team organization is imposed to limit the token passing among one agent and its neighbors. Belief states of the POMDP model are efficiently estimated using Monte Carlo sampling process.
منابع مشابه
Token-based Approach for Scalable Team Coordination
Efficient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. However, state of the art coordination algorithms are not capable of achieving efficient and effective coordination when a is very large. Building on recent successful token-based algorithms for task allocation ...
متن کاملCoordination Approach to Find Best Defense Decision with Multiple Possibilities among Robocup Soccer Simulation Team
In 2D Soccer Simulation league, agents will decide based on information and data in their model. Effective decisions need to have world model information without any noise and missing data; however, there are few solutions to omit noise in world model data; so we should find efficient ways to reduce the effect of noise when making decisions. In this article we evaluate some simple solutions whe...
متن کاملSolving Multi-agent Decision Problems Modeled as Dec-POMDP: A Robot Soccer Case Study
Robot soccer is one of the major domains for studying the coordination of multi-robot teams. Decentralized Partially Observable Markov Decision Process (Dec-POMDP) is a recent mathematical framework which has been used to model multi-agent coordination. In this work, we model simple robot soccer as Dec-POMDP and solve it using an algorithm which is based on the approach detailed in [1]. This al...
متن کاملCoordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach
Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative for the agents to be able to reason about the rewards (and costs) for their actions in the presence of uncertainty. However, finding the optimal distributed POMDP policy is computationally intractable (NEXPComplete). T...
متن کاملUvA Rescue Team Description Paper Agent competition Rescue Simulation League Iran Open 2014
This year’s contribution of the UvA Rescue Team is twofold. On one hand a theoretical contribution is made by describing the planning and coordination problem formally as an POMDP problem, which will allow to apply POMDP-solution methods in this application area. On the other hand the impact of the introduction of flying agents will be studied. Flying agents, when applied correctly, have the po...
متن کامل